Overview
Brought to you by YData
Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 57259 |
| Missing cells | 1145 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 3837 |
| Duplicate rows (%) | 6.7% |
| Total size in memory | 14.9 MiB |
| Average record size in memory | 273.0 B |
Variable types
| Categorical | 14 |
|---|---|
| Numeric | 15 |
| Text | 1 |
| Unsupported | 1 |
| DateTime | 2 |
| Dataset has 3837 (6.7%) duplicate rows | Duplicates |
agent is highly overall correlated with hotel | High correlation |
arrival_date_month is highly overall correlated with arrival_date_week_number | High correlation |
arrival_date_week_number is highly overall correlated with arrival_date_month | High correlation |
assigned_room_type is highly overall correlated with reserved_room_type | High correlation |
distribution_channel is highly overall correlated with market_segment | High correlation |
hotel is highly overall correlated with agent | High correlation |
is_canceled is highly overall correlated with reservation_status | High correlation |
market_segment is highly overall correlated with distribution_channel | High correlation |
reservation_status is highly overall correlated with is_canceled | High correlation |
reserved_room_type is highly overall correlated with assigned_room_type | High correlation |
children is highly imbalanced (79.8%) | Imbalance |
meal is highly imbalanced (53.2%) | Imbalance |
distribution_channel is highly imbalanced (60.3%) | Imbalance |
is_repeated_guest is highly imbalanced (80.4%) | Imbalance |
deposit_type is highly imbalanced (70.5%) | Imbalance |
required_car_parking_spaces is highly imbalanced (80.3%) | Imbalance |
customer_type has 575 (1.0%) missing values | Missing |
adults is highly skewed (γ1 = 24.87432688) | Skewed |
babies is highly skewed (γ1 = 25.35858887) | Skewed |
previous_cancellations is highly skewed (γ1 = 21.01126487) | Skewed |
company is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
lead_time has 3511 (6.1%) zeros | Zeros |
stays_in_weekend_nights has 22722 (39.7%) zeros | Zeros |
stays_in_week_nights has 3486 (6.1%) zeros | Zeros |
babies has 56477 (98.6%) zeros | Zeros |
previous_cancellations has 56224 (98.2%) zeros | Zeros |
previous_bookings_not_canceled has 55492 (96.9%) zeros | Zeros |
booking_changes has 47909 (83.7%) zeros | Zeros |
agent has 8618 (15.1%) zeros | Zeros |
days_in_waiting_list has 54921 (95.9%) zeros | Zeros |
adr has 917 (1.6%) zeros | Zeros |
total_of_special_requests has 36106 (63.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-21 17:52:57.063026 |
|---|---|
| Analysis finished | 2025-09-21 17:53:59.218302 |
| Duration | 1 minute and 2.16 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
hotel
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| Resort Hotel | |
|---|---|
| City Hotel |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.354652 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Resort Hotel |
|---|---|
| 2nd row | Resort Hotel |
| 3rd row | Resort Hotel |
| 4th row | Resort Hotel |
| 5th row | Resort Hotel |
Common Values
| Value | Count | Frequency (%) |
| Resort Hotel | 38783 | |
| City Hotel | 18476 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hotel | 57259 | |
| resort | 38783 | |
| city | 18476 | 16.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 114518 | |
| e | 96042 | |
| o | 96042 | |
| 57259 | ||
| H | 57259 | |
| l | 57259 | |
| s | 38783 | 6.0% |
| R | 38783 | 6.0% |
| r | 38783 | 6.0% |
| C | 18476 | 2.8% |
| Other values (2) | 36952 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 478379 | |
| Uppercase Letter | 114518 | 17.6% |
| Space Separator | 57259 | 8.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 114518 | |
| e | 96042 | |
| o | 96042 | |
| l | 57259 | |
| s | 38783 | 8.1% |
| r | 38783 | 8.1% |
| i | 18476 | 3.9% |
| y | 18476 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 57259 | |
| R | 38783 | |
| C | 18476 | 16.1% |
Space Separator
| Value | Count | Frequency (%) |
| 57259 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 592897 | |
| Common | 57259 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 114518 | |
| e | 96042 | |
| o | 96042 | |
| H | 57259 | |
| l | 57259 | |
| s | 38783 | 6.5% |
| R | 38783 | 6.5% |
| r | 38783 | 6.5% |
| C | 18476 | 3.1% |
| i | 18476 | 3.1% |
Common
| Value | Count | Frequency (%) |
| 57259 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 650156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 114518 | |
| e | 96042 | |
| o | 96042 | |
| 57259 | ||
| H | 57259 | |
| l | 57259 | |
| s | 38783 | 6.0% |
| R | 38783 | 6.0% |
| r | 38783 | 6.0% |
| C | 18476 | 2.8% |
| Other values (2) | 36952 | 5.7% |
is_canceled
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 57259 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 57259 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 33535 | |
| 1 | 23724 |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 428 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.51677 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 3511 |
| Zeros (%) | 6.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 17 |
| median | 69 |
| Q3 | 158 |
| 95-th percentile | 309 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 141 |
Descriptive statistics
| Standard deviation | 101.16703 |
|---|---|
| Coefficient of variation (CV) | 1.0064691 |
| Kurtosis | 0.99744677 |
| Mean | 100.51677 |
| Median Absolute Deviation (MAD) | 60 |
| Skewness | 1.204174 |
| Sum | 5755490 |
| Variance | 10234.768 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3511 | 6.1% |
| 1 | 1827 | 3.2% |
| 2 | 1051 | 1.8% |
| 3 | 902 | 1.6% |
| 4 | 799 | 1.4% |
| 5 | 715 | 1.2% |
| 7 | 658 | 1.1% |
| 6 | 631 | 1.1% |
| 12 | 510 | 0.9% |
| 10 | 506 | 0.9% |
| Other values (418) | 46149 |
| Value | Count | Frequency (%) |
| 0 | 3511 | |
| 1 | 1827 | |
| 2 | 1051 | 1.8% |
| 3 | 902 | 1.6% |
| 4 | 799 | 1.4% |
| 5 | 715 | 1.2% |
| 6 | 631 | 1.1% |
| 7 | 658 | 1.1% |
| 8 | 480 | 0.8% |
| 9 | 466 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 1 | < 0.1% |
| 709 | 1 | < 0.1% |
| 605 | 8 | < 0.1% |
| 542 | 23 | |
| 532 | 1 | < 0.1% |
| 471 | 5 | < 0.1% |
| 468 | 46 | |
| 462 | 19 | |
| 461 | 32 | |
| 460 | 3 | < 0.1% |
arrival_date_year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| 2016 | |
|---|---|
| 2015 | |
| 2017 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 30174 | |
| 2015 | 14255 | |
| 2017 | 12830 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2016 | 30174 | |
| 2015 | 14255 | |
| 2017 | 12830 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 57259 | |
| 0 | 57259 | |
| 1 | 57259 | |
| 6 | 30174 | |
| 5 | 14255 | 6.2% |
| 7 | 12830 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 229036 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 57259 | |
| 0 | 57259 | |
| 1 | 57259 | |
| 6 | 30174 | |
| 5 | 14255 | 6.2% |
| 7 | 12830 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 229036 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 57259 | |
| 0 | 57259 | |
| 1 | 57259 | |
| 6 | 30174 | |
| 5 | 14255 | 6.2% |
| 7 | 12830 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 229036 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 57259 | |
| 0 | 57259 | |
| 1 | 57259 | |
| 6 | 30174 | |
| 5 | 14255 | 6.2% |
| 7 | 12830 | 5.6% |
arrival_date_month
Real number (ℝ)
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.6900575 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.0382557 |
|---|---|
| Coefficient of variation (CV) | 0.45414493 |
| Kurtosis | -0.96336753 |
| Mean | 6.6900575 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.14721126 |
| Sum | 383066 |
| Variance | 9.2309974 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 7550 | |
| 9 | 6555 | |
| 7 | 6018 | |
| 10 | 5857 | |
| 5 | 5157 | |
| 4 | 5034 | |
| 6 | 4615 | |
| 3 | 4365 | |
| 2 | 3705 | |
| 12 | 2996 | 5.2% |
| Other values (2) | 5407 |
| Value | Count | Frequency (%) |
| 1 | 2657 | 4.6% |
| 2 | 3705 | |
| 3 | 4365 | |
| 4 | 5034 | |
| 5 | 5157 | |
| 6 | 4615 | |
| 7 | 6018 | |
| 8 | 7550 | |
| 9 | 6555 | |
| 10 | 5857 |
| Value | Count | Frequency (%) |
| 12 | 2996 | 5.2% |
| 11 | 2750 | 4.8% |
| 10 | 5857 | |
| 9 | 6555 | |
| 8 | 7550 | |
| 7 | 6018 | |
| 6 | 4615 | |
| 5 | 5157 | |
| 4 | 5034 | |
| 3 | 4365 |
arrival_date_week_number
Real number (ℝ)
High correlation 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.834611 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 17 |
| median | 29 |
| Q3 | 38 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 13.311418 |
|---|---|
| Coefficient of variation (CV) | 0.47823259 |
| Kurtosis | -0.93923648 |
| Mean | 27.834611 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.13420212 |
| Sum | 1593782 |
| Variance | 177.19386 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 1977 | 3.5% |
| 34 | 1674 | 2.9% |
| 38 | 1629 | 2.8% |
| 41 | 1628 | 2.8% |
| 32 | 1596 | 2.8% |
| 42 | 1565 | 2.7% |
| 37 | 1522 | 2.7% |
| 40 | 1485 | 2.6% |
| 35 | 1466 | 2.6% |
| 30 | 1454 | 2.5% |
| Other values (43) | 41263 |
| Value | Count | Frequency (%) |
| 1 | 394 | 0.7% |
| 2 | 565 | |
| 3 | 640 | |
| 4 | 653 | |
| 5 | 571 | |
| 6 | 762 | |
| 7 | 1035 | |
| 8 | 854 | |
| 9 | 925 | |
| 10 | 974 |
| Value | Count | Frequency (%) |
| 53 | 780 | |
| 52 | 632 | |
| 51 | 437 | |
| 50 | 489 | |
| 49 | 811 | |
| 48 | 686 | |
| 47 | 786 | |
| 46 | 524 | |
| 45 | 722 | |
| 44 | 962 |
arrival_date_day_of_month
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.770307 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.7834858 |
|---|---|
| Coefficient of variation (CV) | 0.55696353 |
| Kurtosis | -1.176783 |
| Mean | 15.770307 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.020637647 |
| Sum | 902992 |
| Variance | 77.149622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2184 | 3.8% |
| 12 | 2174 | 3.8% |
| 16 | 2151 | 3.8% |
| 17 | 2063 | 3.6% |
| 18 | 2057 | 3.6% |
| 30 | 2051 | 3.6% |
| 26 | 2048 | 3.6% |
| 9 | 2034 | 3.6% |
| 25 | 1968 | 3.4% |
| 15 | 1953 | 3.4% |
| Other values (21) | 36576 |
| Value | Count | Frequency (%) |
| 1 | 1705 | |
| 2 | 1943 | |
| 3 | 1790 | |
| 4 | 1791 | |
| 5 | 2184 | |
| 6 | 1728 | |
| 7 | 1808 | |
| 8 | 1866 | |
| 9 | 2034 | |
| 10 | 1664 |
| Value | Count | Frequency (%) |
| 31 | 1156 | |
| 30 | 2051 | |
| 29 | 1671 | |
| 28 | 1757 | |
| 27 | 1664 | |
| 26 | 2048 | |
| 25 | 1968 | |
| 24 | 1919 | |
| 23 | 1732 | |
| 22 | 1762 |
stays_in_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0619466 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 22722 |
| Zeros (%) | 39.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.089239 |
|---|---|
| Coefficient of variation (CV) | 1.0257004 |
| Kurtosis | 5.4917112 |
| Mean | 1.0619466 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.317999 |
| Sum | 60806 |
| Variance | 1.1864416 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 22722 | |
| 2 | 18024 | |
| 1 | 13659 | |
| 4 | 1599 | 2.8% |
| 3 | 998 | 1.7% |
| 6 | 125 | 0.2% |
| 5 | 50 | 0.1% |
| 8 | 42 | 0.1% |
| 7 | 17 | < 0.1% |
| 9 | 8 | < 0.1% |
| Other values (5) | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 22722 | |
| 1 | 13659 | |
| 2 | 18024 | |
| 3 | 998 | 1.7% |
| 4 | 1599 | 2.8% |
| 5 | 50 | 0.1% |
| 6 | 125 | 0.2% |
| 7 | 17 | < 0.1% |
| 8 | 42 | 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 5 | < 0.1% |
| 10 | 5 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 42 | 0.1% |
| 7 | 17 | < 0.1% |
| 6 | 125 | |
| 5 | 50 | 0.1% |
stays_in_week_nights
Real number (ℝ)
Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8528441 |
| Minimum | 0 |
|---|---|
| Maximum | 40 |
| Zeros | 3486 |
| Zeros (%) | 6.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 40 |
| Range | 40 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.2255226 |
|---|---|
| Coefficient of variation (CV) | 0.78010664 |
| Kurtosis | 14.450583 |
| Mean | 2.8528441 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.3788334 |
| Sum | 163351 |
| Variance | 4.9529509 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 14152 | |
| 1 | 13164 | |
| 3 | 9420 | |
| 5 | 8324 | |
| 4 | 4748 | 8.3% |
| 0 | 3486 | 6.1% |
| 6 | 1188 | 2.1% |
| 10 | 914 | 1.6% |
| 7 | 864 | 1.5% |
| 8 | 515 | 0.9% |
| Other values (21) | 484 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 3486 | 6.1% |
| 1 | 13164 | |
| 2 | 14152 | |
| 3 | 9420 | |
| 4 | 4748 | 8.3% |
| 5 | 8324 | |
| 6 | 1188 | 2.1% |
| 7 | 864 | 1.5% |
| 8 | 515 | 0.9% |
| 9 | 176 | 0.3% |
| Value | Count | Frequency (%) |
| 40 | 2 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 4 | < 0.1% |
| 26 | 1 | < 0.1% |
| 25 | 5 | |
| 24 | 1 | < 0.1% |
| 22 | 2 | < 0.1% |
| 21 | 11 |
adults
Real number (ℝ)
Skewed 
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9731221 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 98 |
| Zeros (%) | 0.2% |
| Negative | 92 |
| Negative (%) | 0.2% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.9399952 |
|---|---|
| Coefficient of variation (CV) | 1.4900219 |
| Kurtosis | 661.32331 |
| Mean | 1.9731221 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.874327 |
| Sum | 112979 |
| Variance | 8.6435718 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 44642 | |
| 1 | 10109 | 17.7% |
| 3 | 2176 | 3.8% |
| 0 | 98 | 0.2% |
| -1 | 92 | 0.2% |
| 4 | 34 | 0.1% |
| 66 | 6 | < 0.1% |
| 26 | 5 | < 0.1% |
| 65 | 5 | < 0.1% |
| 53 | 4 | < 0.1% |
| Other values (44) | 88 | 0.2% |
| Value | Count | Frequency (%) |
| -1 | 92 | 0.2% |
| 0 | 98 | 0.2% |
| 1 | 10109 | 17.7% |
| 2 | 44642 | |
| 3 | 2176 | 3.8% |
| 4 | 34 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 3 | |
| 98 | 2 | |
| 96 | 2 | |
| 95 | 3 | |
| 93 | 1 | < 0.1% |
| 92 | 2 | |
| 91 | 4 | |
| 89 | 1 | < 0.1% |
| 87 | 1 | < 0.1% |
| 86 | 2 |
children
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| 0 | |
|---|---|
| 1 | 2281 |
| 2 | 2069 |
| 3 | 26 |
| 10 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.0000175 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 52882 | |
| 1 | 2281 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
| 10 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 52882 | |
| 1 | 2281 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
| 10 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 52883 | |
| 1 | 2282 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 57260 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52883 | |
| 1 | 2282 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 57260 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 52883 | |
| 1 | 2282 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57260 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 52883 | |
| 1 | 2282 | 4.0% |
| 2 | 2069 | 3.6% |
| 3 | 26 | < 0.1% |
babies
Real number (ℝ)
Skewed  Zeros 
| Distinct | 47 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.13699156 |
| Minimum | -1 |
|---|---|
| Maximum | 100 |
| Zeros | 56477 |
| Zeros (%) | 98.6% |
| Negative | 90 |
| Negative (%) | 0.2% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 100 |
| Range | 101 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3.1286788 |
|---|---|
| Coefficient of variation (CV) | 22.838478 |
| Kurtosis | 665.52972 |
| Mean | 0.13699156 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 25.358589 |
| Sum | 7844 |
| Variance | 9.7886311 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56477 | |
| 1 | 584 | 1.0% |
| -1 | 90 | 0.2% |
| 2 | 7 | < 0.1% |
| 57 | 5 | < 0.1% |
| 51 | 5 | < 0.1% |
| 73 | 5 | < 0.1% |
| 77 | 5 | < 0.1% |
| 94 | 4 | < 0.1% |
| 97 | 4 | < 0.1% |
| Other values (37) | 73 | 0.1% |
| Value | Count | Frequency (%) |
| -1 | 90 | 0.2% |
| 0 | 56477 | |
| 1 | 584 | 1.0% |
| 2 | 7 | < 0.1% |
| 10 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 51 | 5 | < 0.1% |
| 52 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| 54 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 2 | |
| 99 | 2 | |
| 98 | 2 | |
| 97 | 4 | |
| 96 | 2 | |
| 95 | 1 | < 0.1% |
| 94 | 4 | |
| 93 | 3 | |
| 92 | 3 | |
| 91 | 1 | < 0.1% |
meal
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| BB | |
|---|---|
| HB | |
| SC | 1747 |
| Undefined | 1147 |
| FB | 780 |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.1402225 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | BB |
| 3rd row | BB |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 43715 | |
| HB | 9870 | 17.2% |
| SC | 1747 | 3.1% |
| Undefined | 1147 | 2.0% |
| FB | 780 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bb | 43715 | |
| hb | 9870 | 17.2% |
| sc | 1747 | 3.1% |
| undefined | 1147 | 2.0% |
| fb | 780 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 98080 | |
| H | 9870 | 8.1% |
| d | 2294 | 1.9% |
| e | 2294 | 1.9% |
| n | 2294 | 1.9% |
| S | 1747 | 1.4% |
| C | 1747 | 1.4% |
| U | 1147 | 0.9% |
| f | 1147 | 0.9% |
| i | 1147 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 113371 | |
| Lowercase Letter | 9176 | 7.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 98080 | |
| H | 9870 | 8.7% |
| S | 1747 | 1.5% |
| C | 1747 | 1.5% |
| U | 1147 | 1.0% |
| F | 780 | 0.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2294 | |
| e | 2294 | |
| n | 2294 | |
| f | 1147 | |
| i | 1147 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 122547 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 98080 | |
| H | 9870 | 8.1% |
| d | 2294 | 1.9% |
| e | 2294 | 1.9% |
| n | 2294 | 1.9% |
| S | 1747 | 1.4% |
| C | 1747 | 1.4% |
| U | 1147 | 0.9% |
| f | 1147 | 0.9% |
| i | 1147 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 122547 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 98080 | |
| H | 9870 | 8.1% |
| d | 2294 | 1.9% |
| e | 2294 | 1.9% |
| n | 2294 | 1.9% |
| S | 1747 | 1.4% |
| C | 1747 | 1.4% |
| U | 1147 | 0.9% |
| f | 1147 | 0.9% |
| i | 1147 | 0.9% |
country
Text
| Distinct | 141 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.9863078 |
| Min length | 2 |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | PRT |
| 3rd row | GBR |
| 4th row | GBR |
| 5th row | GBR |
| Value | Count | Frequency (%) |
| prt | 27023 | |
| gbr | 7424 | 13.0% |
| esp | 5177 | 9.0% |
| fra | 2980 | 5.2% |
| irl | 2325 | 4.1% |
| deu | 1981 | 3.5% |
| ita | 1261 | 2.2% |
| cn | 784 | 1.4% |
| nld | 732 | 1.3% |
| bel | 721 | 1.3% |
| Other values (131) | 6851 | 12.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 41955 | |
| P | 32772 | |
| T | 28965 | |
| E | 9124 | 5.3% |
| B | 8954 | 5.2% |
| G | 7810 | 4.6% |
| S | 7081 | 4.1% |
| A | 6771 | 4.0% |
| L | 4605 | 2.7% |
| U | 4184 | 2.4% |
| Other values (16) | 18772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 170993 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 41955 | |
| P | 32772 | |
| T | 28965 | |
| E | 9124 | 5.3% |
| B | 8954 | 5.2% |
| G | 7810 | 4.6% |
| S | 7081 | 4.1% |
| A | 6771 | 4.0% |
| L | 4605 | 2.7% |
| U | 4184 | 2.4% |
| Other values (16) | 18772 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 170993 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 41955 | |
| P | 32772 | |
| T | 28965 | |
| E | 9124 | 5.3% |
| B | 8954 | 5.2% |
| G | 7810 | 4.6% |
| S | 7081 | 4.1% |
| A | 6771 | 4.0% |
| L | 4605 | 2.7% |
| U | 4184 | 2.4% |
| Other values (16) | 18772 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170993 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 41955 | |
| P | 32772 | |
| T | 28965 | |
| E | 9124 | 5.3% |
| B | 8954 | 5.2% |
| G | 7810 | 4.6% |
| S | 7081 | 4.1% |
| A | 6771 | 4.0% |
| L | 4605 | 2.7% |
| U | 4184 | 2.4% |
| Other values (16) | 18772 |
market_segment
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Groups | |
| Direct | |
| Corporate | 2381 |
| Other values (3) | 268 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 8.9604429 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | Online TA |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 25190 | |
| Offline TA/TO | 12148 | |
| Groups | 10179 | |
| Direct | 7093 | 12.4% |
| Corporate | 2381 | 4.2% |
| Complementary | 245 | 0.4% |
| Aviation | 21 | < 0.1% |
| Undefined | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 25190 | |
| ta | 25190 | |
| offline | 12148 | |
| ta/to | 12148 | |
| groups | 10179 | |
| direct | 7093 | 7.5% |
| corporate | 2381 | 2.5% |
| complementary | 245 | 0.3% |
| aviation | 21 | < 0.1% |
| undefined | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 62798 | |
| O | 49486 | |
| T | 49486 | |
| e | 47306 | |
| i | 44475 | |
| l | 37583 | 7.3% |
| A | 37359 | 7.3% |
| 37338 | 7.3% | |
| f | 24298 | 4.7% |
| r | 22279 | 4.3% |
| Other values (16) | 100658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 307349 | |
| Uppercase Letter | 156231 | |
| Space Separator | 37338 | 7.3% |
| Other Punctuation | 12148 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 62798 | |
| e | 47306 | |
| i | 44475 | |
| l | 37583 | |
| f | 24298 | 7.9% |
| r | 22279 | 7.2% |
| o | 15207 | 4.9% |
| p | 12805 | 4.2% |
| u | 10179 | 3.3% |
| s | 10179 | 3.3% |
| Other values (7) | 20240 | 6.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 49486 | |
| T | 49486 | |
| A | 37359 | |
| G | 10179 | 6.5% |
| D | 7093 | 4.5% |
| C | 2626 | 1.7% |
| U | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 37338 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12148 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 463580 | |
| Common | 49486 | 9.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 62798 | |
| O | 49486 | |
| T | 49486 | |
| e | 47306 | |
| i | 44475 | |
| l | 37583 | |
| A | 37359 | |
| f | 24298 | 5.2% |
| r | 22279 | 4.8% |
| o | 15207 | 3.3% |
| Other values (14) | 73303 |
Common
| Value | Count | Frequency (%) |
| 37338 | ||
| / | 12148 | 24.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 513066 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 62798 | |
| O | 49486 | |
| T | 49486 | |
| e | 47306 | |
| i | 44475 | |
| l | 37583 | 7.3% |
| A | 37359 | 7.3% |
| 37338 | 7.3% | |
| f | 24298 | 4.7% |
| r | 22279 | 4.3% |
| Other values (16) | 100658 |
distribution_channel
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 3412 |
| GDS | 11 |
| Undefined | 5 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.3868388 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | TA/TO |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 45327 | |
| Direct | 8504 | 14.9% |
| Corporate | 3412 | 6.0% |
| GDS | 11 | < 0.1% |
| Undefined | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta/to | 45327 | |
| direct | 8504 | 14.9% |
| corporate | 3412 | 6.0% |
| gds | 11 | < 0.1% |
| undefined | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 90654 | |
| A | 45327 | |
| / | 45327 | |
| O | 45327 | |
| r | 15328 | 5.0% |
| e | 11926 | 3.9% |
| t | 11916 | 3.9% |
| D | 8515 | 2.8% |
| i | 8509 | 2.8% |
| c | 8504 | 2.8% |
| Other values (10) | 17112 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 193262 | |
| Lowercase Letter | 69856 | 22.6% |
| Other Punctuation | 45327 | 14.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 15328 | |
| e | 11926 | |
| t | 11916 | |
| i | 8509 | |
| c | 8504 | |
| o | 6824 | |
| p | 3412 | 4.9% |
| a | 3412 | 4.9% |
| n | 10 | < 0.1% |
| d | 10 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 90654 | |
| A | 45327 | |
| O | 45327 | |
| D | 8515 | 4.4% |
| C | 3412 | 1.8% |
| G | 11 | < 0.1% |
| S | 11 | < 0.1% |
| U | 5 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 45327 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 263118 | |
| Common | 45327 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 90654 | |
| A | 45327 | |
| O | 45327 | |
| r | 15328 | 5.8% |
| e | 11926 | 4.5% |
| t | 11916 | 4.5% |
| D | 8515 | 3.2% |
| i | 8509 | 3.2% |
| c | 8504 | 3.2% |
| o | 6824 | 2.6% |
| Other values (9) | 10288 | 3.9% |
Common
| Value | Count | Frequency (%) |
| / | 45327 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 308445 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 90654 | |
| A | 45327 | |
| / | 45327 | |
| O | 45327 | |
| r | 15328 | 5.0% |
| e | 11926 | 3.9% |
| t | 11916 | 3.9% |
| D | 8515 | 2.8% |
| i | 8509 | 2.8% |
| c | 8504 | 2.8% |
| Other values (10) | 17112 | 5.5% |
is_repeated_guest
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| 0 | |
|---|---|
| 1 | 1734 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 57259 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 57259 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 55525 | |
| 1 | 1734 | 3.0% |
previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06945633 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 56224 |
| Zeros (%) | 98.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1127908 |
|---|---|
| Coefficient of variation (CV) | 16.021445 |
| Kurtosis | 452.18405 |
| Mean | 0.06945633 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.011265 |
| Sum | 3977 |
| Variance | 1.2383033 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56224 | |
| 1 | 843 | 1.5% |
| 24 | 48 | 0.1% |
| 2 | 40 | 0.1% |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 19 | 18 | < 0.1% |
| 3 | 14 | < 0.1% |
| 14 | 13 | < 0.1% |
| 4 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 56224 | |
| 1 | 843 | 1.5% |
| 2 | 40 | 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 3 | < 0.1% |
| 14 | 13 | < 0.1% |
| 19 | 18 | < 0.1% |
| 24 | 48 | 0.1% |
| 25 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 24 | 48 | 0.1% |
| 19 | 18 | < 0.1% |
| 14 | 13 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 5 | < 0.1% |
| 3 | 14 | < 0.1% |
| 2 | 40 | 0.1% |
| 1 | 843 |
previous_bookings_not_canceled
Real number (ℝ)
Zeros 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08737491 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 55492 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.76767839 |
|---|---|
| Coefficient of variation (CV) | 8.7860278 |
| Kurtosis | 407.53657 |
| Mean | 0.08737491 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.018519 |
| Sum | 5003 |
| Variance | 0.58933011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 55492 | |
| 1 | 864 | 1.5% |
| 2 | 336 | 0.6% |
| 3 | 169 | 0.3% |
| 4 | 107 | 0.2% |
| 5 | 78 | 0.1% |
| 6 | 50 | 0.1% |
| 7 | 31 | 0.1% |
| 8 | 30 | 0.1% |
| 9 | 19 | < 0.1% |
| Other values (20) | 83 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 55492 | |
| 1 | 864 | 1.5% |
| 2 | 336 | 0.6% |
| 3 | 169 | 0.3% |
| 4 | 107 | 0.2% |
| 5 | 78 | 0.1% |
| 6 | 50 | 0.1% |
| 7 | 31 | 0.1% |
| 8 | 30 | 0.1% |
| 9 | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 2 | |
| 26 | 1 | < 0.1% |
| 25 | 3 | |
| 24 | 2 | |
| 23 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 21 | 2 | |
| 20 | 1 | < 0.1% |
reserved_room_type
Categorical
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| A | |
|---|---|
| D | |
| E | |
| G | 1601 |
| F | 1477 |
| Other values (4) | 1884 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 37981 | |
| D | 9312 | 16.3% |
| E | 5004 | 8.7% |
| G | 1601 | 2.8% |
| F | 1477 | 2.6% |
| C | 898 | 1.6% |
| H | 589 | 1.0% |
| B | 392 | 0.7% |
| L | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 37981 | |
| d | 9312 | 16.3% |
| e | 5004 | 8.7% |
| g | 1601 | 2.8% |
| f | 1477 | 2.6% |
| c | 898 | 1.6% |
| h | 589 | 1.0% |
| b | 392 | 0.7% |
| l | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 37981 | |
| D | 9312 | 16.3% |
| E | 5004 | 8.7% |
| G | 1601 | 2.8% |
| F | 1477 | 2.6% |
| C | 898 | 1.6% |
| H | 589 | 1.0% |
| B | 392 | 0.7% |
| L | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 57259 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 37981 | |
| D | 9312 | 16.3% |
| E | 5004 | 8.7% |
| G | 1601 | 2.8% |
| F | 1477 | 2.6% |
| C | 898 | 1.6% |
| H | 589 | 1.0% |
| B | 392 | 0.7% |
| L | 5 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57259 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 37981 | |
| D | 9312 | 16.3% |
| E | 5004 | 8.7% |
| G | 1601 | 2.8% |
| F | 1477 | 2.6% |
| C | 898 | 1.6% |
| H | 589 | 1.0% |
| B | 392 | 0.7% |
| L | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 37981 | |
| D | 9312 | 16.3% |
| E | 5004 | 8.7% |
| G | 1601 | 2.8% |
| F | 1477 | 2.6% |
| C | 898 | 1.6% |
| H | 589 | 1.0% |
| B | 392 | 0.7% |
| L | 5 | < 0.1% |
assigned_room_type
Categorical
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| A | |
|---|---|
| D | |
| E | |
| C | 2155 |
| F | 2114 |
| Other values (5) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 30596 | |
| D | 12924 | |
| E | 5739 | 10.0% |
| C | 2155 | 3.8% |
| F | 2114 | 3.7% |
| G | 1863 | 3.3% |
| B | 805 | 1.4% |
| H | 693 | 1.2% |
| I | 348 | 0.6% |
| K | 22 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 30596 | |
| d | 12924 | |
| e | 5739 | 10.0% |
| c | 2155 | 3.8% |
| f | 2114 | 3.7% |
| g | 1863 | 3.3% |
| b | 805 | 1.4% |
| h | 693 | 1.2% |
| i | 348 | 0.6% |
| k | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 30596 | |
| D | 12924 | |
| E | 5739 | 10.0% |
| C | 2155 | 3.8% |
| F | 2114 | 3.7% |
| G | 1863 | 3.3% |
| B | 805 | 1.4% |
| H | 693 | 1.2% |
| I | 348 | 0.6% |
| K | 22 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 57259 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 30596 | |
| D | 12924 | |
| E | 5739 | 10.0% |
| C | 2155 | 3.8% |
| F | 2114 | 3.7% |
| G | 1863 | 3.3% |
| B | 805 | 1.4% |
| H | 693 | 1.2% |
| I | 348 | 0.6% |
| K | 22 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57259 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 30596 | |
| D | 12924 | |
| E | 5739 | 10.0% |
| C | 2155 | 3.8% |
| F | 2114 | 3.7% |
| G | 1863 | 3.3% |
| B | 805 | 1.4% |
| H | 693 | 1.2% |
| I | 348 | 0.6% |
| K | 22 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 57259 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 30596 | |
| D | 12924 | |
| E | 5739 | 10.0% |
| C | 2155 | 3.8% |
| F | 2114 | 3.7% |
| G | 1863 | 3.3% |
| B | 805 | 1.4% |
| H | 693 | 1.2% |
| I | 348 | 0.6% |
| K | 22 | < 0.1% |
booking_changes
Real number (ℝ)
Zeros 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.24287885 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 47909 |
| Zeros (%) | 83.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.69558072 |
|---|---|
| Coefficient of variation (CV) | 2.8639 |
| Kurtosis | 69.801833 |
| Mean | 0.24287885 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.7791198 |
| Sum | 13907 |
| Variance | 0.48383254 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 47909 | |
| 1 | 6555 | 11.4% |
| 2 | 1872 | 3.3% |
| 3 | 537 | 0.9% |
| 4 | 214 | 0.4% |
| 5 | 76 | 0.1% |
| 6 | 42 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (8) | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 47909 | |
| 1 | 6555 | 11.4% |
| 2 | 1872 | 3.3% |
| 3 | 537 | 0.9% |
| 4 | 214 | 0.4% |
| 5 | 76 | 0.1% |
| 6 | 42 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 5 | |
| 12 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 7 | |
| 8 | 11 |
deposit_type
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| No Deposit | |
|---|---|
| Non Refund | |
| No Refund | 941 |
| Refundable | 141 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.9835659 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 50844 | |
| Non Refund | 5333 | 9.3% |
| No Refund | 941 | 1.6% |
| Refundable | 141 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 51785 | |
| deposit | 50844 | |
| refund | 6274 | 5.5% |
| non | 5333 | 4.7% |
| refundable | 141 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 107962 | |
| e | 57400 | |
| N | 57118 | |
| 57118 | ||
| D | 50844 | |
| p | 50844 | |
| s | 50844 | |
| i | 50844 | |
| t | 50844 | |
| n | 11748 | 2.1% |
| Other values (7) | 26083 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 400154 | |
| Uppercase Letter | 114377 | 20.0% |
| Space Separator | 57118 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 107962 | |
| e | 57400 | |
| p | 50844 | |
| s | 50844 | |
| i | 50844 | |
| t | 50844 | |
| n | 11748 | 2.9% |
| f | 6415 | 1.6% |
| u | 6415 | 1.6% |
| d | 6415 | 1.6% |
| Other values (3) | 423 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 57118 | |
| D | 50844 | |
| R | 6415 | 5.6% |
Space Separator
| Value | Count | Frequency (%) |
| 57118 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 514531 | |
| Common | 57118 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 107962 | |
| e | 57400 | |
| N | 57118 | |
| D | 50844 | |
| p | 50844 | |
| s | 50844 | |
| i | 50844 | |
| t | 50844 | |
| n | 11748 | 2.3% |
| R | 6415 | 1.2% |
| Other values (6) | 19668 | 3.8% |
Common
| Value | Count | Frequency (%) |
| 57118 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 571649 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 107962 | |
| e | 57400 | |
| N | 57118 | |
| 57118 | ||
| D | 50844 | |
| p | 50844 | |
| s | 50844 | |
| i | 50844 | |
| t | 50844 | |
| n | 11748 | 2.1% |
| Other values (7) | 26083 | 4.6% |
agent
Real number (ℝ)
High correlation  Zeros 
| Distinct | 249 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 124.64032 |
| Minimum | 0 |
|---|---|
| Maximum | 535 |
| Zeros | 8618 |
| Zeros (%) | 15.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 75 |
| Q3 | 240 |
| 95-th percentile | 298 |
| Maximum | 535 |
| Range | 535 |
| Interquartile range (IQR) | 233 |
Descriptive statistics
| Standard deviation | 122.52477 |
|---|---|
| Coefficient of variation (CV) | 0.98302677 |
| Kurtosis | -1.1083975 |
| Mean | 124.64032 |
| Median Absolute Deviation (MAD) | 75 |
| Skewness | 0.38339404 |
| Sum | 7136780 |
| Variance | 15012.319 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 240 | 13579 | |
| 0 | 8618 | |
| 9 | 6891 | |
| 1 | 3119 | 5.4% |
| 250 | 2791 | 4.9% |
| 241 | 1677 | 2.9% |
| 6 | 1349 | 2.4% |
| 40 | 990 | 1.7% |
| 314 | 904 | 1.6% |
| 242 | 762 | 1.3% |
| Other values (239) | 16579 |
| Value | Count | Frequency (%) |
| 0 | 8618 | |
| 1 | 3119 | 5.4% |
| 2 | 117 | 0.2% |
| 3 | 545 | 1.0% |
| 5 | 248 | 0.4% |
| 6 | 1349 | 2.4% |
| 7 | 476 | 0.8% |
| 8 | 549 | 1.0% |
| 9 | 6891 | |
| 10 | 38 | 0.1% |
| Value | Count | Frequency (%) |
| 535 | 3 | < 0.1% |
| 531 | 65 | |
| 527 | 35 | |
| 526 | 10 | < 0.1% |
| 510 | 2 | < 0.1% |
| 508 | 6 | < 0.1% |
| 502 | 23 | < 0.1% |
| 497 | 1 | < 0.1% |
| 495 | 50 | |
| 493 | 34 |
company
Unsupported
Rejected  Unsupported 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
days_in_waiting_list
Real number (ℝ)
Zeros 
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5441939 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 54921 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 21.887563 |
|---|---|
| Coefficient of variation (CV) | 6.1756109 |
| Kurtosis | 100.35843 |
| Mean | 3.5441939 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.8782447 |
| Sum | 202937 |
| Variance | 479.0654 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 54921 | |
| 39 | 180 | 0.3% |
| 58 | 163 | 0.3% |
| 31 | 98 | 0.2% |
| 69 | 89 | 0.2% |
| 87 | 78 | 0.1% |
| 63 | 77 | 0.1% |
| 111 | 68 | 0.1% |
| 101 | 63 | 0.1% |
| 77 | 62 | 0.1% |
| Other values (89) | 1460 | 2.5% |
| Value | Count | Frequency (%) |
| 0 | 54921 | |
| 1 | 7 | < 0.1% |
| 2 | 2 | < 0.1% |
| 3 | 59 | 0.1% |
| 4 | 10 | < 0.1% |
| 5 | 4 | < 0.1% |
| 6 | 4 | < 0.1% |
| 8 | 6 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 14 | < 0.1% |
| 379 | 15 | < 0.1% |
| 330 | 14 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 35 | |
| 224 | 10 | < 0.1% |
| 223 | 59 | |
| 215 | 21 | < 0.1% |
| 207 | 15 | < 0.1% |
| 187 | 45 |
customer_type
Categorical
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 575 |
| Missing (%) | 1.0% |
| Memory size | 894.7 KiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 2442 |
| Group | 300 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.28703 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 41176 | |
| Transient-Party | 12766 | 22.3% |
| Contract | 2442 | 4.3% |
| Group | 300 | 0.5% |
| (Missing) | 575 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| transient | 41176 | |
| transient-party | 12766 | 22.5% |
| contract | 2442 | 4.3% |
| group | 300 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 110326 | |
| t | 71592 | |
| r | 69450 | |
| a | 69150 | |
| T | 53942 | |
| s | 53942 | |
| i | 53942 | |
| e | 53942 | |
| - | 12766 | 2.2% |
| P | 12766 | 2.2% |
| Other values (7) | 21292 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 500894 | |
| Uppercase Letter | 69450 | 11.9% |
| Dash Punctuation | 12766 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 110326 | |
| t | 71592 | |
| r | 69450 | |
| a | 69150 | |
| s | 53942 | |
| i | 53942 | |
| e | 53942 | |
| y | 12766 | 2.5% |
| o | 2742 | 0.5% |
| c | 2442 | 0.5% |
| Other values (2) | 600 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 53942 | |
| P | 12766 | 18.4% |
| C | 2442 | 3.5% |
| G | 300 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12766 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 570344 | |
| Common | 12766 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 110326 | |
| t | 71592 | |
| r | 69450 | |
| a | 69150 | |
| T | 53942 | |
| s | 53942 | |
| i | 53942 | |
| e | 53942 | |
| P | 12766 | 2.2% |
| y | 12766 | 2.2% |
| Other values (6) | 8526 | 1.5% |
Common
| Value | Count | Frequency (%) |
| - | 12766 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 583110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 110326 | |
| t | 71592 | |
| r | 69450 | |
| a | 69150 | |
| T | 53942 | |
| s | 53942 | |
| i | 53942 | |
| e | 53942 | |
| - | 12766 | 2.2% |
| P | 12766 | 2.2% |
| Other values (7) | 21292 | 3.7% |
adr
Real number (ℝ)
Zeros 
| Distinct | 6680 |
|---|---|
| Distinct (%) | 11.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.599672 |
| Minimum | -6.38 |
|---|---|
| Maximum | 5400 |
| Zeros | 917 |
| Zeros (%) | 1.6% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | -6.38 |
|---|---|
| 5-th percentile | 34.427 |
| Q1 | 60 |
| median | 84.7 |
| Q3 | 120.44 |
| 95-th percentile | 207.9 |
| Maximum | 5400 |
| Range | 5406.38 |
| Interquartile range (IQR) | 60.44 |
Descriptive statistics
| Standard deviation | 58.668984 |
|---|---|
| Coefficient of variation (CV) | 0.60734144 |
| Kurtosis | 1166.914 |
| Mean | 96.599672 |
| Median Absolute Deviation (MAD) | 29.3 |
| Skewness | 13.877957 |
| Sum | 5531200.6 |
| Variance | 3442.0497 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 1756 | 3.1% |
| 75 | 1055 | 1.8% |
| 48 | 991 | 1.7% |
| 0 | 917 | 1.6% |
| 65 | 902 | 1.6% |
| 60 | 779 | 1.4% |
| 90 | 714 | 1.2% |
| 120 | 688 | 1.2% |
| 80 | 680 | 1.2% |
| 70 | 647 | 1.1% |
| Other values (6670) | 48130 |
| Value | Count | Frequency (%) |
| -6.38 | 1 | < 0.1% |
| 0 | 917 | |
| 0.26 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 1 | 3 | < 0.1% |
| 1.56 | 2 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 2 | 8 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| 4 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 5400 | 1 | |
| 508 | 1 | |
| 450 | 1 | |
| 437 | 1 | |
| 426.25 | 1 | |
| 402 | 1 | |
| 397.38 | 1 | |
| 392 | 2 | |
| 388 | 2 | |
| 387 | 1 |
required_car_parking_spaces
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 570 |
| Missing (%) | 1.0% |
| Memory size | 894.7 KiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | 24 |
| 8.0 | 2 |
| 3.0 | 1 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 51303 | |
| 1.0 | 5359 | 9.4% |
| 2.0 | 24 | < 0.1% |
| 8.0 | 2 | < 0.1% |
| 3.0 | 1 | < 0.1% |
| (Missing) | 570 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 51303 | |
| 1.0 | 5359 | 9.5% |
| 2.0 | 24 | < 0.1% |
| 8.0 | 2 | < 0.1% |
| 3.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 107992 | |
| . | 56689 | |
| 1 | 5359 | 3.2% |
| 2 | 24 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 113378 | |
| Other Punctuation | 56689 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 107992 | |
| 1 | 5359 | 4.7% |
| 2 | 24 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 56689 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 170067 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 107992 | |
| . | 56689 | |
| 1 | 5359 | 3.2% |
| 2 | 24 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170067 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 107992 | |
| . | 56689 | |
| 1 | 5359 | 3.2% |
| 2 | 24 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
total_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.51258317 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 36106 |
| Zeros (%) | 63.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 894.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.76801752 |
|---|---|
| Coefficient of variation (CV) | 1.4983276 |
| Kurtosis | 1.8446116 |
| Mean | 0.51258317 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4828921 |
| Sum | 29350 |
| Variance | 0.58985092 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36106 | |
| 1 | 14315 | 25.0% |
| 2 | 5646 | 9.9% |
| 3 | 1036 | 1.8% |
| 4 | 145 | 0.3% |
| 5 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 36106 | |
| 1 | 14315 | 25.0% |
| 2 | 5646 | 9.9% |
| 3 | 1036 | 1.8% |
| 4 | 145 | 0.3% |
| 5 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 11 | < 0.1% |
| 4 | 145 | 0.3% |
| 3 | 1036 | 1.8% |
| 2 | 5646 | 9.9% |
| 1 | 14315 | 25.0% |
| 0 | 36106 |
reservation_status
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| Check-Out | |
|---|---|
| Canceled | |
| No-Show | 788 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.5719101 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Check-Out |
|---|---|
| 2nd row | Check-Out |
| 3rd row | Check-Out |
| 4th row | Check-Out |
| 5th row | Check-Out |
Common Values
| Value | Count | Frequency (%) |
| Check-Out | 33535 | |
| Canceled | 22936 | |
| No-Show | 788 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| check-out | 33535 | |
| canceled | 22936 | |
| no-show | 788 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 79407 | |
| C | 56471 | |
| c | 56471 | |
| h | 34323 | |
| - | 34323 | |
| k | 33535 | |
| O | 33535 | |
| u | 33535 | |
| t | 33535 | |
| a | 22936 | 4.7% |
| Other values (7) | 72748 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 364914 | |
| Uppercase Letter | 91582 | 18.7% |
| Dash Punctuation | 34323 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 79407 | |
| c | 56471 | |
| h | 34323 | |
| k | 33535 | |
| u | 33535 | |
| t | 33535 | |
| a | 22936 | 6.3% |
| n | 22936 | 6.3% |
| l | 22936 | 6.3% |
| d | 22936 | 6.3% |
| Other values (2) | 2364 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 56471 | |
| O | 33535 | |
| N | 788 | 0.9% |
| S | 788 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34323 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 456496 | |
| Common | 34323 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 79407 | |
| C | 56471 | |
| c | 56471 | |
| h | 34323 | |
| k | 33535 | |
| O | 33535 | |
| u | 33535 | |
| t | 33535 | |
| a | 22936 | 5.0% |
| n | 22936 | 5.0% |
| Other values (6) | 49812 |
Common
| Value | Count | Frequency (%) |
| - | 34323 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 490819 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 79407 | |
| C | 56471 | |
| c | 56471 | |
| h | 34323 | |
| - | 34323 | |
| k | 33535 | |
| O | 33535 | |
| u | 33535 | |
| t | 33535 | |
| a | 22936 | 4.7% |
| Other values (7) | 72748 |
| Distinct | 921 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| Minimum | 2014-11-18 00:00:00 |
|---|---|
| Maximum | 2017-09-14 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
arrival_date
Date
| Distinct | 793 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 894.7 KiB |
| Minimum | 2015-07-01 00:00:00 |
|---|---|
| Maximum | 2017-08-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Interactions
Correlations
| adr | adults | agent | arrival_date_day_of_month | arrival_date_month | arrival_date_week_number | arrival_date_year | assigned_room_type | babies | booking_changes | children | customer_type | days_in_waiting_list | deposit_type | distribution_channel | hotel | is_canceled | is_repeated_guest | lead_time | market_segment | meal | previous_bookings_not_canceled | previous_cancellations | required_car_parking_spaces | reservation_status | reserved_room_type | stays_in_week_nights | stays_in_weekend_nights | total_of_special_requests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| adr | 1.000 | 0.303 | 0.112 | 0.032 | 0.110 | 0.111 | 0.000 | 0.000 | 0.036 | -0.009 | 0.000 | 0.000 | 0.023 | 0.011 | 0.000 | 0.000 | 0.000 | 0.000 | 0.089 | 0.000 | 0.000 | -0.134 | -0.082 | 0.000 | 0.000 | 0.000 | 0.146 | 0.094 | 0.130 |
| adults | 0.303 | 1.000 | 0.124 | 0.005 | 0.043 | 0.042 | 0.016 | 0.000 | 0.030 | -0.049 | 0.000 | 0.106 | -0.034 | 0.000 | 0.003 | 0.009 | 0.013 | 0.000 | 0.168 | 0.007 | 0.000 | -0.200 | -0.022 | 0.000 | 0.006 | 0.000 | 0.145 | 0.130 | 0.138 |
| agent | 0.112 | 0.124 | 1.000 | -0.012 | -0.042 | -0.046 | 0.227 | 0.117 | 0.024 | 0.026 | 0.080 | 0.179 | -0.062 | 0.143 | 0.142 | 0.616 | 0.165 | 0.059 | 0.088 | 0.263 | 0.186 | -0.123 | -0.022 | 0.081 | 0.121 | 0.128 | 0.213 | 0.215 | 0.246 |
| arrival_date_day_of_month | 0.032 | 0.005 | -0.012 | 1.000 | -0.038 | 0.053 | 0.046 | 0.015 | 0.000 | 0.006 | 0.012 | 0.034 | 0.030 | 0.061 | 0.038 | 0.050 | 0.020 | 0.016 | -0.006 | 0.042 | 0.050 | 0.009 | -0.024 | 0.010 | 0.022 | 0.016 | -0.018 | -0.016 | 0.005 |
| arrival_date_month | 0.110 | 0.043 | -0.042 | -0.038 | 1.000 | 0.995 | 0.422 | 0.046 | 0.009 | -0.002 | 0.077 | 0.124 | 0.023 | 0.123 | 0.085 | 0.234 | 0.168 | 0.123 | 0.110 | 0.111 | 0.102 | -0.070 | 0.052 | 0.030 | 0.132 | 0.060 | 0.026 | 0.020 | 0.018 |
| arrival_date_week_number | 0.111 | 0.042 | -0.046 | 0.053 | 0.995 | 1.000 | 0.419 | 0.047 | 0.008 | -0.002 | 0.070 | 0.118 | 0.028 | 0.118 | 0.092 | 0.248 | 0.182 | 0.128 | 0.109 | 0.104 | 0.101 | -0.070 | 0.048 | 0.030 | 0.142 | 0.056 | 0.023 | 0.019 | 0.015 |
| arrival_date_year | 0.000 | 0.016 | 0.227 | 0.046 | 0.422 | 0.419 | 1.000 | 0.119 | 0.000 | 0.031 | 0.057 | 0.157 | 0.083 | 0.117 | 0.050 | 0.372 | 0.215 | 0.101 | 0.139 | 0.133 | 0.126 | 0.045 | 0.058 | 0.042 | 0.152 | 0.141 | 0.095 | 0.072 | 0.122 |
| assigned_room_type | 0.000 | 0.000 | 0.117 | 0.015 | 0.046 | 0.047 | 0.119 | 1.000 | 0.000 | 0.081 | 0.327 | 0.097 | 0.043 | 0.169 | 0.106 | 0.382 | 0.262 | 0.086 | 0.061 | 0.132 | 0.097 | 0.011 | 0.019 | 0.096 | 0.188 | 0.732 | 0.071 | 0.062 | 0.085 |
| babies | 0.036 | 0.030 | 0.024 | 0.000 | 0.009 | 0.008 | 0.000 | 0.000 | 1.000 | 0.105 | 0.000 | 0.000 | -0.019 | 0.008 | 0.000 | 0.000 | 0.000 | 0.000 | -0.007 | 0.000 | 0.000 | -0.016 | -0.010 | 0.023 | 0.000 | 0.000 | 0.031 | 0.026 | 0.100 |
| booking_changes | -0.009 | -0.049 | 0.026 | 0.006 | -0.002 | -0.002 | 0.031 | 0.081 | 0.105 | 1.000 | 0.022 | 0.035 | -0.024 | 0.022 | 0.030 | 0.041 | 0.057 | 0.000 | 0.028 | 0.023 | 0.018 | 0.025 | -0.030 | 0.018 | 0.040 | 0.016 | 0.095 | 0.058 | 0.055 |
| children | 0.000 | 0.000 | 0.080 | 0.012 | 0.077 | 0.070 | 0.057 | 0.327 | 0.000 | 0.022 | 1.000 | 0.062 | 0.022 | 0.058 | 0.041 | 0.061 | 0.043 | 0.033 | 0.025 | 0.105 | 0.030 | 0.000 | 0.000 | 0.031 | 0.039 | 0.386 | 0.030 | 0.033 | 0.049 |
| customer_type | 0.000 | 0.106 | 0.179 | 0.034 | 0.124 | 0.118 | 0.157 | 0.097 | 0.000 | 0.035 | 0.062 | 1.000 | 0.106 | 0.115 | 0.098 | 0.109 | 0.199 | 0.153 | 0.075 | 0.311 | 0.126 | 0.036 | 0.006 | 0.057 | 0.141 | 0.122 | 0.149 | 0.132 | 0.115 |
| days_in_waiting_list | 0.023 | -0.034 | -0.062 | 0.030 | 0.023 | 0.028 | 0.083 | 0.043 | -0.019 | -0.024 | 0.022 | 0.106 | 1.000 | 0.113 | 0.034 | 0.221 | 0.061 | 0.027 | 0.183 | 0.092 | 0.063 | -0.033 | -0.027 | 0.042 | 0.047 | 0.038 | -0.005 | -0.100 | -0.138 |
| deposit_type | 0.011 | 0.000 | 0.143 | 0.061 | 0.123 | 0.118 | 0.117 | 0.169 | 0.008 | 0.022 | 0.058 | 0.115 | 0.113 | 1.000 | 0.074 | 0.307 | 0.410 | 0.061 | 0.227 | 0.275 | 0.069 | 0.012 | 0.063 | 0.068 | 0.297 | 0.127 | 0.075 | 0.065 | 0.155 |
| distribution_channel | 0.000 | 0.003 | 0.142 | 0.038 | 0.085 | 0.092 | 0.050 | 0.106 | 0.000 | 0.030 | 0.041 | 0.098 | 0.034 | 0.074 | 1.000 | 0.230 | 0.199 | 0.218 | 0.113 | 0.665 | 0.063 | 0.103 | 0.037 | 0.079 | 0.145 | 0.119 | 0.041 | 0.069 | 0.077 |
| hotel | 0.000 | 0.009 | 0.616 | 0.050 | 0.234 | 0.248 | 0.372 | 0.382 | 0.000 | 0.041 | 0.061 | 0.109 | 0.221 | 0.307 | 0.230 | 1.000 | 0.395 | 0.122 | 0.152 | 0.219 | 0.286 | 0.056 | 0.035 | 0.203 | 0.396 | 0.318 | 0.269 | 0.176 | 0.222 |
| is_canceled | 0.000 | 0.013 | 0.165 | 0.020 | 0.168 | 0.182 | 0.215 | 0.262 | 0.000 | 0.057 | 0.043 | 0.199 | 0.061 | 0.410 | 0.199 | 0.395 | 1.000 | 0.126 | 0.239 | 0.228 | 0.141 | 0.064 | 0.057 | 0.272 | 1.000 | 0.091 | 0.068 | 0.043 | 0.218 |
| is_repeated_guest | 0.000 | 0.000 | 0.059 | 0.016 | 0.123 | 0.128 | 0.101 | 0.086 | 0.000 | 0.000 | 0.033 | 0.153 | 0.027 | 0.061 | 0.218 | 0.122 | 0.126 | 1.000 | 0.127 | 0.276 | 0.057 | 0.334 | 0.064 | 0.082 | 0.126 | 0.036 | 0.060 | 0.087 | 0.066 |
| lead_time | 0.089 | 0.168 | 0.088 | -0.006 | 0.110 | 0.109 | 0.139 | 0.061 | -0.007 | 0.028 | 0.025 | 0.075 | 0.183 | 0.227 | 0.113 | 0.152 | 0.239 | 0.127 | 1.000 | 0.175 | 0.091 | -0.180 | 0.087 | 0.071 | 0.179 | 0.046 | 0.394 | 0.247 | -0.075 |
| market_segment | 0.000 | 0.007 | 0.263 | 0.042 | 0.111 | 0.104 | 0.133 | 0.132 | 0.000 | 0.023 | 0.105 | 0.311 | 0.092 | 0.275 | 0.665 | 0.219 | 0.228 | 0.276 | 0.175 | 1.000 | 0.180 | 0.091 | 0.042 | 0.106 | 0.172 | 0.148 | 0.075 | 0.083 | 0.203 |
| meal | 0.000 | 0.000 | 0.186 | 0.050 | 0.102 | 0.101 | 0.126 | 0.097 | 0.000 | 0.018 | 0.030 | 0.126 | 0.063 | 0.069 | 0.063 | 0.286 | 0.141 | 0.057 | 0.091 | 0.180 | 1.000 | 0.015 | 0.088 | 0.029 | 0.106 | 0.074 | 0.090 | 0.076 | 0.054 |
| previous_bookings_not_canceled | -0.134 | -0.200 | -0.123 | 0.009 | -0.070 | -0.070 | 0.045 | 0.011 | -0.016 | 0.025 | 0.000 | 0.036 | -0.033 | 0.012 | 0.103 | 0.056 | 0.064 | 0.334 | -0.180 | 0.091 | 0.015 | 1.000 | 0.120 | 0.026 | 0.045 | 0.008 | -0.111 | -0.090 | 0.023 |
| previous_cancellations | -0.082 | -0.022 | -0.022 | -0.024 | 0.052 | 0.048 | 0.058 | 0.019 | -0.010 | -0.030 | 0.000 | 0.006 | -0.027 | 0.063 | 0.037 | 0.035 | 0.057 | 0.064 | 0.087 | 0.042 | 0.088 | 0.120 | 1.000 | 0.000 | 0.041 | 0.013 | 0.005 | 0.004 | -0.036 |
| required_car_parking_spaces | 0.000 | 0.000 | 0.081 | 0.010 | 0.030 | 0.030 | 0.042 | 0.096 | 0.023 | 0.018 | 0.031 | 0.057 | 0.042 | 0.068 | 0.079 | 0.203 | 0.272 | 0.082 | 0.071 | 0.106 | 0.029 | 0.026 | 0.000 | 1.000 | 0.192 | 0.080 | 0.038 | 0.044 | 0.060 |
| reservation_status | 0.000 | 0.006 | 0.121 | 0.022 | 0.132 | 0.142 | 0.152 | 0.188 | 0.000 | 0.040 | 0.039 | 0.141 | 0.047 | 0.297 | 0.145 | 0.396 | 1.000 | 0.126 | 0.179 | 0.172 | 0.106 | 0.045 | 0.041 | 0.192 | 1.000 | 0.066 | 0.052 | 0.035 | 0.156 |
| reserved_room_type | 0.000 | 0.000 | 0.128 | 0.016 | 0.060 | 0.056 | 0.141 | 0.732 | 0.000 | 0.016 | 0.386 | 0.122 | 0.038 | 0.127 | 0.119 | 0.318 | 0.091 | 0.036 | 0.046 | 0.148 | 0.074 | 0.008 | 0.013 | 0.080 | 0.066 | 1.000 | 0.076 | 0.067 | 0.089 |
| stays_in_week_nights | 0.146 | 0.145 | 0.213 | -0.018 | 0.026 | 0.023 | 0.095 | 0.071 | 0.031 | 0.095 | 0.030 | 0.149 | -0.005 | 0.075 | 0.041 | 0.269 | 0.068 | 0.060 | 0.394 | 0.075 | 0.090 | -0.111 | 0.005 | 0.038 | 0.052 | 0.076 | 1.000 | 0.428 | 0.101 |
| stays_in_weekend_nights | 0.094 | 0.130 | 0.215 | -0.016 | 0.020 | 0.019 | 0.072 | 0.062 | 0.026 | 0.058 | 0.033 | 0.132 | -0.100 | 0.065 | 0.069 | 0.176 | 0.043 | 0.087 | 0.247 | 0.083 | 0.076 | -0.090 | 0.004 | 0.044 | 0.035 | 0.067 | 0.428 | 1.000 | 0.103 |
| total_of_special_requests | 0.130 | 0.138 | 0.246 | 0.005 | 0.018 | 0.015 | 0.122 | 0.085 | 0.100 | 0.055 | 0.049 | 0.115 | -0.138 | 0.155 | 0.077 | 0.222 | 0.218 | 0.066 | -0.075 | 0.203 | 0.054 | 0.023 | -0.036 | 0.060 | 0.156 | 0.089 | 0.101 | 0.103 | 1.000 |
Missing values
Sample
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | arrival_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Resort Hotel | 0 | 342 | 2015 | 7 | 27 | 1 | 0 | 0 | 2 | 0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 3 | No Deposit | 0.0 | 0 | 0.0 | Transient | 0.0 | 0.0 | 0.0 | Check-Out | 2015-07-01 | 2015-07-01 |
| 1 | Resort Hotel | 0 | 737 | 2015 | 7 | 27 | 1 | 0 | 0 | 2 | 0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 4 | No Deposit | 0.0 | 0 | 0.0 | Transient | 0.0 | 0.0 | 0.0 | Check-Out | 2015-07-01 | 2015-07-01 |
| 2 | Resort Hotel | 0 | 7 | 2015 | 7 | 27 | 1 | 0 | 1 | 1 | 0 | 0 | BB | GBR | Direct | Direct | 0 | 0 | 0 | A | C | 0 | No Deposit | 0.0 | 0 | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Check-Out | 2015-07-02 | 2015-07-01 |
| 3 | Resort Hotel | 0 | 13 | 2015 | 7 | 27 | 1 | 0 | 1 | 1 | 0 | 0 | BB | GBR | Corporate | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | 304.0 | 0 | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Check-Out | 2015-07-02 | 2015-07-01 |
| 4 | Resort Hotel | 0 | 14 | 2015 | 7 | 27 | 1 | 0 | 2 | 2 | 0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | 0 | 0.0 | Transient | 98.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | 2015-07-01 |
| 5 | Resort Hotel | 0 | 14 | 2015 | 7 | 27 | 1 | 0 | 2 | 2 | 0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | 0 | 0.0 | Transient | 98.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | 2015-07-01 |
| 6 | Resort Hotel | 0 | 0 | 2015 | 7 | 27 | 1 | 0 | 2 | 2 | 0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | 0.0 | 0 | 0.0 | Transient | 107.0 | 0.0 | 0.0 | Check-Out | 2015-07-03 | 2015-07-01 |
| 7 | Resort Hotel | 0 | 9 | 2015 | 7 | 27 | 1 | 0 | 2 | 2 | 0 | 0 | FB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | 303.0 | 0 | 0.0 | Transient | 103.0 | 0.0 | 1.0 | Check-Out | 2015-07-03 | 2015-07-01 |
| 8 | Resort Hotel | 1 | 85 | 2015 | 7 | 27 | 1 | 0 | 3 | 2 | 0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | 0 | 0.0 | Transient | 82.0 | 0.0 | 1.0 | Canceled | 2015-05-06 | 2015-07-01 |
| 9 | Resort Hotel | 1 | 75 | 2015 | 7 | 27 | 1 | 0 | 3 | 2 | 0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 15.0 | 0 | 0.0 | Transient | 105.5 | 0.0 | 0.0 | Canceled | 2015-04-22 | 2015-07-01 |
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | arrival_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 58884 | City Hotel | 1 | 605 | 2016 | 10 | 43 | 17 | 1 | 2 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0 | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | 2016-10-17 |
| 58885 | City Hotel | 1 | 605 | 2016 | 10 | 43 | 17 | 1 | 2 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0 | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | 2016-10-17 |
| 58886 | City Hotel | 1 | 605 | 2016 | 10 | 43 | 17 | 1 | 2 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0 | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | 2016-10-17 |
| 58887 | City Hotel | 1 | 605 | 2016 | 10 | 43 | 17 | 1 | 2 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0 | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | 2016-10-17 |
| 58888 | City Hotel | 1 | 605 | 2016 | 10 | 43 | 17 | 1 | 2 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0 | 0.0 | Transient | 60.00 | 0.0 | 0.0 | Canceled | 2016-09-20 | 2016-10-17 |
| 58890 | Resort Hotel | 0 | 3 | 2016 | 4 | 16 | 11 | 1 | 0 | 1 | 0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | 0 | 0.0 | Transient-Party | 56.00 | 0.0 | 1.0 | Check-Out | 2016-04-12 | 2016-04-11 |
| 58891 | Resort Hotel | 1 | 158 | 2016 | 5 | 20 | 8 | 2 | 2 | 2 | 0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | F | F | 2 | No Deposit | 250.0 | 0 | 0.0 | Transient | 83.05 | 0.0 | 1.0 | Canceled | 2016-01-21 | 2016-05-08 |
| 58892 | City Hotel | 1 | 18 | 2016 | 8 | 32 | 6 | 2 | 2 | 2 | 0 | 0 | BB | ESP | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | 0 | 0.0 | Transient | 151.00 | 0.0 | 2.0 | Canceled | 2016-07-28 | 2016-08-06 |
| 58893 | Resort Hotel | 1 | 383 | 2016 | 10 | 41 | 6 | 1 | 3 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 315.0 | 0 | 0.0 | Transient-Party | 48.00 | 0.0 | 0.0 | Canceled | 2016-03-04 | 2016-10-06 |
| 58894 | City Hotel | 1 | 185 | 2016 | 7 | 28 | 5 | 0 | 4 | 2 | 0 | 0 | BB | DEU | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 9.0 | 0 | 0.0 | Transient | 90.95 | 0.0 | 1.0 | Canceled | 2016-05-31 | 2016-07-05 |
Duplicate rows
Most frequently occurring
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | arrival_date | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1312 | City Hotel | 1 | 188 | 2016 | 6 | 25 | 15 | 0 | 2 | 1 | 0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | 39.0 | Transient | 130.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | 2016-06-15 | 92 |
| 740 | City Hotel | 1 | 37 | 2016 | 10 | 42 | 13 | 0 | 3 | 2 | 0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 56.0 | 0.0 | Transient-Party | 105.0 | 0.0 | 0.0 | Canceled | 2016-09-06 | 2016-10-13 | 79 |
| 1224 | City Hotel | 1 | 158 | 2016 | 5 | 22 | 24 | 0 | 2 | 1 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | 31.0 | Transient | 130.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | 2016-05-24 | 79 |
| 744 | City Hotel | 1 | 39 | 2015 | 8 | 33 | 14 | 0 | 2 | 2 | 0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 6.0 | 0.0 | Transient-Party | 101.5 | 0.0 | 0.0 | Canceled | 2015-07-06 | 2015-08-14 | 69 |
| 893 | City Hotel | 1 | 71 | 2016 | 6 | 25 | 14 | 0 | 3 | 1 | 0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | 0.0 | Transient | 120.0 | 0.0 | 0.0 | Canceled | 2016-04-27 | 2016-06-14 | 69 |
| 554 | City Hotel | 1 | 1 | 2016 | 2 | 10 | 28 | 2 | 1 | 1 | 0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 134.0 | 0.0 | Transient-Party | 60.0 | 0.0 | 0.0 | Canceled | 2016-02-27 | 2016-02-28 | 64 |
| 966 | City Hotel | 1 | 87 | 2015 | 9 | 39 | 25 | 2 | 3 | 2 | 0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 1.0 | 0.0 | Transient | 170.0 | 0.0 | 0.0 | Canceled | 2015-09-09 | 2015-09-25 | 59 |
| 818 | City Hotel | 1 | 56 | 2016 | 6 | 24 | 8 | 0 | 1 | 2 | 0 | 0 | BB | PRT | Offline TA/TO | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | 191.0 | 0.0 | Transient-Party | 120.0 | 0.0 | 0.0 | Canceled | 2016-06-02 | 2016-06-08 | 58 |
| 910 | City Hotel | 1 | 74 | 2015 | 9 | 38 | 18 | 0 | 2 | 2 | 0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 6.0 | 0.0 | Transient-Party | 101.5 | 0.0 | 0.0 | Canceled | 2015-07-06 | 2015-09-18 | 54 |
| 1058 | City Hotel | 1 | 105 | 2016 | 4 | 15 | 6 | 0 | 1 | 2 | 0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 12.0 | 0.0 | Transient | 75.0 | 0.0 | 0.0 | Canceled | 2016-01-18 | 2016-04-06 | 53 |